Feature extraction and clustering for the computer-aided reconstruction of strip-cut shredded documents
نویسندگان
چکیده
bstract. We propose a solution for the computer-aided recontruction of strip-cut shredded documents. First of all, the visual conent of the strips is automatically extracted and represented by a umber of numerical features. Usually, the pieces of different pages ave been mixed. A grouping of the strips belonging to a same page s thus realized by means of a clustering operator, to ease the sucessive matching performed by a human operator with the help of a omputer. © 2008 SPIE and IS&T. DOI: 10.1117/1.2898551
منابع مشابه
Reconstructing Shredded Documents
This project looks at the challenges involved in the automatic reconstruction of strip (vertically cut) and cross (both vertically and horizontally cut) shredded documents. The unshredding problem is of interest in the fields of forensics, investigative sciences, and archaeology. All stages of the unshredding pipeline are analysed, starting from scanned images of shreds and ending with reconstr...
متن کاملAn alternative clustering approach for reconstructing cross cut shredded text documents
In this paper, we propose a clustering approach for solving the problem of reconstructing cross-cut shredded documents. This problem is important in the field of forensic science. Unlike other clustering approaches which are applied as a preprocessing step before the actual reconstruction algorithms, our clustering approach is part of the reconstruction process itself. We define a new cost func...
متن کاملA Memetic Algorithm for Reconstructing Cross-Cut Shredded Text Documents
The reconstruction of destroyed paper documents became of more interest during the last years. On the one hand it (often) occurs that documents are destroyed by mistake while on the other hand this type of application is relevant in the fields of forensics and archeology, e.g., for evidence or restoring ancient documents. Within this paper, we present a new approach for restoring cross-cut shre...
متن کاملAn Improved K-Nearest Neighbor with Crow Search Algorithm for Feature Selection in Text Documents Classification
The Internet provides easy access to a kind of library resources. However, classification of documents from a large amount of data is still an issue and demands time and energy to find certain documents. Classification of similar documents in specific classes of data can reduce the time for searching the required data, particularly text documents. This is further facilitated by using Artificial...
متن کاملA New Approach for Text Documents Classification with Invasive Weed Optimization and Naive Bayes Classifier
With the fast increase of the documents, using Text Document Classification (TDC) methods has become a crucial matter. This paper presented a hybrid model of Invasive Weed Optimization (IWO) and Naive Bayes (NB) classifier (IWO-NB) for Feature Selection (FS) in order to reduce the big size of features space in TDC. TDC includes different actions such as text processing, feature extraction, form...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- J. Electronic Imaging
دوره 17 شماره
صفحات -
تاریخ انتشار 2008